Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 589 |
| Missing cells (%) | 0.4% |
| Duplicate rows | 2 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 1020.6 KiB |
| Average record size in memory | 209.0 B |
Variable types
| Numeric | 21 |
|---|---|
| Categorical | 3 |
| Boolean | 3 |
IsRetweet has constant value "False" | Constant |
| Dataset has 2 (< 0.1%) duplicate rows | Duplicates |
Text has a high cardinality: 4962 distinct values | High cardinality |
SenderLocation has a high cardinality: 426 distinct values | High cardinality |
Id is highly correlated with Retweets# and 3 other fields | High correlation |
Retweets# is highly correlated with Id and 3 other fields | High correlation |
Favorites# is highly correlated with Id and 3 other fields | High correlation |
SenderId is highly correlated with SenderAccountYears and 1 other fields | High correlation |
SenderAccountYears is highly correlated with SenderId and 1 other fields | High correlation |
SenderFavorites# is highly correlated with SenderFollowings# and 1 other fields | High correlation |
SenderFollowings# is highly correlated with SenderFavorites# | High correlation |
SenderFollowers# is highly correlated with Retweets# and 2 other fields | High correlation |
SenderStatues# is highly correlated with SenderId and 3 other fields | High correlation |
Punctuations# is highly correlated with UpperCaseLetter# and 3 other fields | High correlation |
UpperCaseLetter# is highly correlated with Punctuations# and 2 other fields | High correlation |
Letter# is highly correlated with Punctuations# and 2 other fields | High correlation |
Words# is highly correlated with Punctuations# and 2 other fields | High correlation |
TWords# is highly correlated with Punctuations# and 3 other fields | High correlation |
UWords# is highly correlated with UpperCaseLetter# | High correlation |
SlangWords# is highly correlated with Id and 1 other fields | High correlation |
IsCyberbullying is highly correlated with Id and 3 other fields | High correlation |
Retweets# is highly correlated with Favorites# | High correlation |
Favorites# is highly correlated with Retweets# | High correlation |
Punctuations# is highly correlated with Letter# and 1 other fields | High correlation |
UpperCaseLetter# is highly correlated with TWords# and 1 other fields | High correlation |
Letter# is highly correlated with Punctuations# and 2 other fields | High correlation |
Words# is highly correlated with Punctuations# and 2 other fields | High correlation |
TWords# is highly correlated with UpperCaseLetter# and 3 other fields | High correlation |
UWords# is highly correlated with UpperCaseLetter# and 1 other fields | High correlation |
SlangWords# is highly correlated with IsCyberbullying | High correlation |
IsCyberbullying is highly correlated with SlangWords# | High correlation |
Id is highly correlated with IsCyberbullying | High correlation |
Retweets# is highly correlated with Favorites# and 2 other fields | High correlation |
Favorites# is highly correlated with Retweets# and 1 other fields | High correlation |
SenderId is highly correlated with SenderAccountYears | High correlation |
SenderAccountYears is highly correlated with SenderId | High correlation |
SenderFollowers# is highly correlated with Retweets# and 1 other fields | High correlation |
UpperCaseLetter# is highly correlated with TWords# | High correlation |
Letter# is highly correlated with Words# | High correlation |
Words# is highly correlated with Letter# | High correlation |
TWords# is highly correlated with UpperCaseLetter# | High correlation |
SlangWords# is highly correlated with IsCyberbullying | High correlation |
IsCyberbullying is highly correlated with Id and 2 other fields | High correlation |
IsCyberbullying is highly correlated with IsRetweet | High correlation |
IsRetweet is highly correlated with IsCyberbullying and 2 other fields | High correlation |
Medias# is highly correlated with IsRetweet | High correlation |
IsSelfMentioned is highly correlated with IsRetweet | High correlation |
Id is highly correlated with Retweets# | High correlation |
Retweets# is highly correlated with Id and 1 other fields | High correlation |
Favorites# is highly correlated with Retweets# | High correlation |
UpperCaseLetter# is highly correlated with TWords# and 1 other fields | High correlation |
Letter# is highly correlated with Words# and 1 other fields | High correlation |
Words# is highly correlated with Letter# and 1 other fields | High correlation |
TWords# is highly correlated with UpperCaseLetter# and 3 other fields | High correlation |
UWords# is highly correlated with UpperCaseLetter# and 1 other fields | High correlation |
SlangWords# is highly correlated with IsCyberbullying | High correlation |
IsCyberbullying is highly correlated with SlangWords# | High correlation |
SenderLocation has 364 (7.3%) missing values | Missing |
AvgWordLength has 170 (3.4%) missing values | Missing |
SenderAccountYears is highly skewed (γ1 = 32.60134166) | Skewed |
Text is uniformly distributed | Uniform |
Retweets# has 2862 (57.2%) zeros | Zeros |
Favorites# has 2457 (49.1%) zeros | Zeros |
Hashtags# has 4426 (88.5%) zeros | Zeros |
Mentions# has 3244 (64.9%) zeros | Zeros |
SenderAccountYears has 1201 (24.0%) zeros | Zeros |
SenderFavorites# has 175 (3.5%) zeros | Zeros |
SenderFollowings# has 130 (2.6%) zeros | Zeros |
SenderFollowers# has 132 (2.6%) zeros | Zeros |
Emojis# has 4286 (85.7%) zeros | Zeros |
Punctuations# has 1954 (39.1%) zeros | Zeros |
UpperCaseLetter# has 668 (13.4%) zeros | Zeros |
Symbols# has 4771 (95.4%) zeros | Zeros |
TWords# has 675 (13.5%) zeros | Zeros |
UWords# has 4394 (87.9%) zeros | Zeros |
SlangWords# has 2206 (44.1%) zeros | Zeros |
Reproduction
| Analysis started | 2022-01-04 08:50:02.445036 |
|---|---|
| Analysis finished | 2022-01-04 08:51:50.596895 |
| Duration | 1 minute and 48.15 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1465 |
|---|---|
| Distinct (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.149112634 × 1018 |
| Minimum | 1292782403 |
|---|---|
| Maximum | 1.2073 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 1292782403 |
|---|---|
| 5-th percentile | 1.085241 × 1018 |
| Q1 | 1.15814 × 1018 |
| median | 1.15927 × 1018 |
| Q3 | 1.16201 × 1018 |
| 95-th percentile | 1.20725 × 1018 |
| Maximum | 1.2073 × 1018 |
| Range | 1.207299999 × 1018 |
| Interquartile range (IQR) | 3.87 × 1015 |
Descriptive statistics
| Standard deviation | 8.370279448 × 1016 |
|---|---|
| Coefficient of variation (CV) | 0.07284124462 |
| Kurtosis | 78.85631493 |
| Mean | 1.149112634 × 1018 |
| Median Absolute Deviation (MAD) | 1.93 × 1015 |
| Skewness | -7.825538008 |
| Sum | 5.745563171 × 1021 |
| Variance | 7.006157804 × 1033 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.15839 × 1018 | 168 | 3.4% |
| 1.15843 × 1018 | 149 | 3.0% |
| 1.15838 × 1018 | 103 | 2.1% |
| 1.20728 × 1018 | 96 | 1.9% |
| 1.1584 × 1018 | 86 | 1.7% |
| 1.20727 × 1018 | 63 | 1.3% |
| 1.15844 × 1018 | 49 | 1.0% |
| 1.20725 × 1018 | 34 | 0.7% |
| 1.20726 × 1018 | 30 | 0.6% |
| 1.15837 × 1018 | 28 | 0.6% |
| Other values (1455) | 4194 |
| Value | Count | Frequency (%) |
| 1292782403 | 1 | |
| 2.288804507 × 1010 | 1 | |
| 2.664014936 × 1010 | 1 | |
| 2.54852 × 1016 | 1 | |
| 6.91767 × 1016 | 1 | |
| 7.30291 × 1016 | 1 | |
| 8.1419 × 1016 | 1 | |
| 1.07941 × 1017 | 1 | |
| 1.32155 × 1017 | 1 | |
| 1.54195 × 1017 | 1 |
| Value | Count | Frequency (%) |
| 1.2073 × 1018 | 22 | 0.4% |
| 1.20729 × 1018 | 15 | 0.3% |
| 1.20728 × 1018 | 96 | |
| 1.20727 × 1018 | 63 | |
| 1.20726 × 1018 | 30 | 0.6% |
| 1.20725 × 1018 | 34 | 0.7% |
| 1.20724 × 1018 | 14 | 0.3% |
| 1.20723 × 1018 | 24 | 0.5% |
| 1.20722 × 1018 | 9 | 0.2% |
| 1.20721 × 1018 | 13 | 0.3% |
| Distinct | 4962 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| merhaba ben kayseri den travesti hasret sevda görüşmeleri mi kendime ait evimde yapıyorum ne aradıgını biliyorsan ve ciddiysen eğer görüşelim müsaitim şuan 0541 691 29 19 bayan degilim 0541 691 29 19 | 5 |
|---|---|
| piç herif | 3 |
| neşet ertaş elini kalbine götürdü burası varya dedi taşa toprağa gerzek kalmadan insanın gömüldüğü tek yer | 3 |
| hoÅŸt | 3 |
| bursatravesti sınırsız oldu bitti yok bursa altıparmak travesti afra 05366906903 | 3 |
| Other values (4957) |
Length
| Max length | 320 |
|---|---|
| Median length | 77 |
| Mean length | 103.2358 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4932 ? |
|---|---|
| Unique (%) | 98.6% |
Sample
| 1st row | bir adam yanında çocuklaşan kadını fazladan sevmeli çünkü bu yalnızken hep güçlü göründüm izninle huzur bulduğum yerde biraz şımarmak istiyorum deme şeklidir perşembe |
|---|---|
| 2nd row | mağlup mu desem mahcup mu ama ikisi de değil ben garip sen güzel dünya umutlu öyle bir tuhafım bu akşamüstü sevgilim canavar götünü gibi iki yanım iki süngü ahmed arif perşembe |
| 3rd row | günaydın iyi pazarlar allah acil şifalar versin inşallah daha da iyi olacak |
| 4th row | ve ahmet arif leyla sına seslenir sevdiğim çaresizliğimden gayrı hiçbir kabahatim yok benim aşına ekmeğine kahrına karanlığına özlemine umuduna kat beni pazar hayalettimde |
| 5th row | arkadaki sanal gerzek oyunun oynuyor |
Common Values
| Value | Count | Frequency (%) |
| merhaba ben kayseri den travesti hasret sevda görüşmeleri mi kendime ait evimde yapıyorum ne aradıgını biliyorsan ve ciddiysen eğer görüşelim müsaitim şuan 0541 691 29 19 bayan degilim 0541 691 29 19 | 5 | 0.1% |
| piç herif | 3 | 0.1% |
| neşet ertaş elini kalbine götürdü burası varya dedi taşa toprağa gerzek kalmadan insanın gömüldüğü tek yer | 3 | 0.1% |
| hoÅŸt | 3 | 0.1% |
| bursatravesti sınırsız oldu bitti yok bursa altıparmak travesti afra 05366906903 | 3 | 0.1% |
| siz kahpe ölmediyseniz bize dertten bir şey olmaz merak etmeyin | 3 | 0.1% |
| interpol ün aradığı cip suriye ye götünü istenirken ele geçirildi | 2 | < 0.1% |
| ben çevrem genişlesin diye orospu çocuklarının yüzüne gülmem | 2 | < 0.1% |
| gunaydın | 2 | < 0.1% |
| orospu evladı | 2 | < 0.1% |
| Other values (4952) | 4972 |
Length
| Value | Count | Frequency (%) |
| bir | 970 | 1.4% |
| bu | 776 | 1.1% |
| ve | 686 | 1.0% |
| ne | 527 | 0.7% |
| da | 433 | 0.6% |
| de | 397 | 0.6% |
| iã§in | 387 | 0.6% |
| aq | 351 | 0.5% |
| ben | 322 | 0.5% |
| var | 316 | 0.4% |
| Other values (23663) | 65190 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Memory size | 39.2 KiB |
| False | |
|---|---|
| (Missing) | 5 |
| Value | Count | Frequency (%) |
| False | 4995 | |
| (Missing) | 5 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | 0.1% |
| Memory size | 39.2 KiB |
| False | |
|---|---|
| True | 4 |
| (Missing) | 6 |
| Value | Count | Frequency (%) |
| False | 4990 | |
| True | 4 | 0.1% |
| (Missing) | 6 | 0.1% |
Retweets#
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 471 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 33 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 132.1044896 |
| Minimum | 0 |
|---|---|
| Maximum | 29086 |
| Zeros | 2862 |
| Zeros (%) | 57.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 7 |
| 95-th percentile | 302.7 |
| Maximum | 29086 |
| Range | 29086 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 1078.051319 |
|---|---|
| Coefficient of variation (CV) | 8.160595613 |
| Kurtosis | 368.8622646 |
| Mean | 132.1044896 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.41103037 |
| Sum | 656163 |
| Variance | 1162194.645 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2862 | |
| 1 | 412 | 8.2% |
| 2 | 149 | 3.0% |
| 3 | 103 | 2.1% |
| 4 | 65 | 1.3% |
| 5 | 64 | 1.3% |
| 7 | 54 | 1.1% |
| 6 | 50 | 1.0% |
| 10 | 39 | 0.8% |
| 8 | 37 | 0.7% |
| Other values (461) | 1132 | 22.6% |
| (Missing) | 33 | 0.7% |
| Value | Count | Frequency (%) |
| 0 | 2862 | |
| 1 | 412 | 8.2% |
| 2 | 149 | 3.0% |
| 3 | 103 | 2.1% |
| 4 | 65 | 1.3% |
| 5 | 64 | 1.3% |
| 6 | 50 | 1.0% |
| 7 | 54 | 1.1% |
| 8 | 37 | 0.7% |
| 9 | 22 | 0.4% |
| Value | Count | Frequency (%) |
| 29086 | 1 | |
| 28735 | 1 | |
| 24517 | 1 | |
| 20530 | 1 | |
| 19648 | 1 | |
| 19255 | 1 | |
| 19074 | 1 | |
| 15996 | 1 | |
| 15279 | 1 | |
| 11868 | 1 |
Favorites#
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 884 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 733.0774 |
| Minimum | 0 |
|---|---|
| Maximum | 163984 |
| Zeros | 2457 |
| Zeros (%) | 49.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 44 |
| 95-th percentile | 2388.5 |
| Maximum | 163984 |
| Range | 163984 |
| Interquartile range (IQR) | 44 |
Descriptive statistics
| Standard deviation | 4955.513189 |
|---|---|
| Coefficient of variation (CV) | 6.759877183 |
| Kurtosis | 385.8352734 |
| Mean | 733.0774 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 16.6144198 |
| Sum | 3665387 |
| Variance | 24557110.97 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2457 | |
| 1 | 382 | 7.6% |
| 2 | 126 | 2.5% |
| 3 | 94 | 1.9% |
| 4 | 67 | 1.3% |
| 5 | 48 | 1.0% |
| 6 | 36 | 0.7% |
| 7 | 36 | 0.7% |
| 8 | 35 | 0.7% |
| 16 | 29 | 0.6% |
| Other values (874) | 1690 |
| Value | Count | Frequency (%) |
| 0 | 2457 | |
| 1 | 382 | 7.6% |
| 2 | 126 | 2.5% |
| 3 | 94 | 1.9% |
| 4 | 67 | 1.3% |
| 5 | 48 | 1.0% |
| 6 | 36 | 0.7% |
| 7 | 36 | 0.7% |
| 8 | 35 | 0.7% |
| 9 | 28 | 0.6% |
| Value | Count | Frequency (%) |
| 163984 | 1 | |
| 113898 | 1 | |
| 94126 | 1 | |
| 86375 | 1 | |
| 84372 | 1 | |
| 81212 | 1 | |
| 70262 | 1 | |
| 57185 | 1 | |
| 51518 | 1 | |
| 46465 | 1 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2032 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 4426 |
| Zeros (%) | 88.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.932139226 |
|---|---|
| Coefficient of variation (CV) | 4.587299341 |
| Kurtosis | 221.4808724 |
| Mean | 0.2032 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.4027872 |
| Sum | 1016 |
| Variance | 0.8688835367 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4426 | |
| 1 | 399 | 8.0% |
| 2 | 95 | 1.9% |
| 3 | 42 | 0.8% |
| 4 | 11 | 0.2% |
| 5 | 8 | 0.2% |
| 7 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 9 | 2 | < 0.1% |
| 18 | 2 | < 0.1% |
| Other values (8) | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 4426 | |
| 1 | 399 | 8.0% |
| 2 | 95 | 1.9% |
| 3 | 42 | 0.8% |
| 4 | 11 | 0.2% |
| 5 | 8 | 0.2% |
| 6 | 3 | 0.1% |
| 7 | 4 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 23 | 1 | |
| 20 | 1 | |
| 18 | 2 | |
| 17 | 1 | |
| 16 | 1 | |
| 12 | 1 | |
| 11 | 1 | |
| 10 | 1 | |
| 9 | 2 | |
| 8 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 88 |
| 4 | 67 |
| 3 | 24 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3920 | |
| 1 | 901 | 18.0% |
| 2 | 88 | 1.8% |
| 4 | 67 | 1.3% |
| 3 | 24 | 0.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 3920 | |
| 1 | 901 | 18.0% |
| 2 | 88 | 1.8% |
| 4 | 67 | 1.3% |
| 3 | 24 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5614 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 3244 |
| Zeros (%) | 64.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.654804434 |
|---|---|
| Coefficient of variation (CV) | 2.947638821 |
| Kurtosis | 468.6157982 |
| Mean | 0.5614 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.74017422 |
| Sum | 2807 |
| Variance | 2.738377716 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3244 | |
| 1 | 1294 | 25.9% |
| 2 | 286 | 5.7% |
| 3 | 102 | 2.0% |
| 4 | 27 | 0.5% |
| 5 | 12 | 0.2% |
| 6 | 9 | 0.2% |
| 10 | 4 | 0.1% |
| 8 | 4 | 0.1% |
| 7 | 3 | 0.1% |
| Other values (13) | 15 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 3244 | |
| 1 | 1294 | 25.9% |
| 2 | 286 | 5.7% |
| 3 | 102 | 2.0% |
| 4 | 27 | 0.5% |
| 5 | 12 | 0.2% |
| 6 | 9 | 0.2% |
| 7 | 3 | 0.1% |
| 8 | 4 | 0.1% |
| 9 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 50 | 1 | |
| 49 | 1 | |
| 48 | 1 | |
| 29 | 1 | |
| 19 | 1 | |
| 17 | 1 | |
| 16 | 1 | |
| 15 | 1 | |
| 14 | 1 | |
| 13 | 1 |
| Distinct | 4167 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.62601359 × 1017 |
| Minimum | 3696241 |
|---|---|
| Maximum | 1.20707 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 3696241 |
|---|---|
| 5-th percentile | 124726283 |
| Q1 | 1208505124 |
| median | 8.02847 × 1017 |
| Q3 | 1.06708 × 1018 |
| 95-th percentile | 1.1583415 × 1018 |
| Maximum | 1.20707 × 1018 |
| Range | 1.20707 × 1018 |
| Interquartile range (IQR) | 1.067079999 × 1018 |
Descriptive statistics
| Standard deviation | 5.140665155 × 1017 |
|---|---|
| Coefficient of variation (CV) | 0.9137313788 |
| Kurtosis | -1.873239105 |
| Mean | 5.62601359 × 1017 |
| Median Absolute Deviation (MAD) | 3.54858 × 1017 |
| Skewness | -0.1152238405 |
| Sum | 2.813006795 × 1021 |
| Variance | 2.642643823 × 1035 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4604138494 | 26 | 0.5% |
| 1.09486 × 1018 | 22 | 0.4% |
| 1.00988 × 1018 | 15 | 0.3% |
| 23186079 | 14 | 0.3% |
| 423805915 | 10 | 0.2% |
| 68034431 | 9 | 0.2% |
| 1.09997 × 1018 | 9 | 0.2% |
| 7.33373 × 1017 | 8 | 0.2% |
| 71108476 | 8 | 0.2% |
| 1.19244 × 1018 | 8 | 0.2% |
| Other values (4157) | 4871 |
| Value | Count | Frequency (%) |
| 3696241 | 1 | |
| 4495931 | 1 | |
| 15016209 | 1 | |
| 15883237 | 1 | |
| 16310319 | 1 | |
| 16626956 | 1 | |
| 18871213 | 1 | |
| 19149088 | 1 | |
| 19942168 | 2 | |
| 20672583 | 1 |
| Value | Count | Frequency (%) |
| 1.20707 × 1018 | 1 | < 0.1% |
| 1.20704 × 1018 | 1 | < 0.1% |
| 1.20703 × 1018 | 1 | < 0.1% |
| 1.207 × 1018 | 1 | < 0.1% |
| 1.20664 × 1018 | 1 | < 0.1% |
| 1.20655 × 1018 | 1 | < 0.1% |
| 1.20653 × 1018 | 4 | |
| 1.20634 × 1018 | 1 | < 0.1% |
| 1.2056 × 1018 | 1 | < 0.1% |
| 1.20439 × 1018 | 1 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.342 |
| Minimum | 0 |
|---|---|
| Maximum | 2020 |
| Zeros | 1201 |
| Zeros (%) | 24.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 2020 |
| Range | 2020 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 59.55636681 |
|---|---|
| Coefficient of variation (CV) | 11.14870214 |
| Kurtosis | 1078.461339 |
| Mean | 5.342 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 32.60134166 |
| Sum | 26710 |
| Variance | 3546.960828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1201 | |
| 1 | 790 | |
| 2 | 493 | |
| 8 | 374 | 7.5% |
| 3 | 366 | 7.3% |
| 6 | 361 | 7.2% |
| 7 | 357 | 7.1% |
| 4 | 329 | 6.6% |
| 5 | 283 | 5.7% |
| 9 | 250 | 5.0% |
| Other values (5) | 196 | 3.9% |
| Value | Count | Frequency (%) |
| 0 | 1201 | |
| 1 | 790 | |
| 2 | 493 | |
| 3 | 366 | 7.3% |
| 4 | 329 | 6.6% |
| 5 | 283 | 5.7% |
| 6 | 361 | 7.2% |
| 7 | 357 | 7.1% |
| 8 | 374 | 7.5% |
| 9 | 250 | 5.0% |
| Value | Count | Frequency (%) |
| 2020 | 4 | 0.1% |
| 1200 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 4 | 0.1% |
| 10 | 185 | |
| 9 | 250 | |
| 8 | 374 | |
| 7 | 357 | |
| 6 | 361 | |
| 5 | 283 |
| Distinct | 3356 |
|---|---|
| Distinct (%) | 67.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20832.5534 |
| Minimum | 0 |
|---|---|
| Maximum | 814597 |
| Zeros | 175 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 575.25 |
| median | 3871 |
| Q3 | 16760.25 |
| 95-th percentile | 91890 |
| Maximum | 814597 |
| Range | 814597 |
| Interquartile range (IQR) | 16185 |
Descriptive statistics
| Standard deviation | 53253.03531 |
|---|---|
| Coefficient of variation (CV) | 2.556241393 |
| Kurtosis | 49.69667545 |
| Mean | 20832.5534 |
| Median Absolute Deviation (MAD) | 3815 |
| Skewness | 5.9199112 |
| Sum | 104162767 |
| Variance | 2835885770 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 175 | 3.5% |
| 1 | 34 | 0.7% |
| 2 | 33 | 0.7% |
| 4 | 29 | 0.6% |
| 357 | 28 | 0.6% |
| 1603 | 22 | 0.4% |
| 5 | 20 | 0.4% |
| 7 | 18 | 0.4% |
| 3 | 16 | 0.3% |
| 67 | 16 | 0.3% |
| Other values (3346) | 4609 |
| Value | Count | Frequency (%) |
| 0 | 175 | |
| 1 | 34 | 0.7% |
| 2 | 33 | 0.7% |
| 3 | 16 | 0.3% |
| 4 | 29 | 0.6% |
| 5 | 20 | 0.4% |
| 6 | 11 | 0.2% |
| 7 | 18 | 0.4% |
| 8 | 12 | 0.2% |
| 9 | 14 | 0.3% |
| Value | Count | Frequency (%) |
| 814597 | 2 | |
| 687355 | 1 | |
| 640750 | 1 | |
| 564549 | 1 | |
| 514491 | 1 | |
| 509636 | 1 | |
| 489252 | 1 | |
| 480456 | 1 | |
| 445796 | 2 | |
| 442057 | 1 |
| Distinct | 1659 |
|---|---|
| Distinct (%) | 33.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6714.929 |
| Minimum | 0 |
|---|---|
| Maximum | 1658816 |
| Zeros | 130 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 97 |
| median | 297 |
| Q3 | 955 |
| 95-th percentile | 8675.05 |
| Maximum | 1658816 |
| Range | 1658816 |
| Interquartile range (IQR) | 858 |
Descriptive statistics
| Standard deviation | 65110.99435 |
|---|---|
| Coefficient of variation (CV) | 9.696453135 |
| Kurtosis | 237.7327406 |
| Mean | 6714.929 |
| Median Absolute Deviation (MAD) | 248 |
| Skewness | 14.5559893 |
| Sum | 33574645 |
| Variance | 4239441586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 130 | 2.6% |
| 12 | 100 | 2.0% |
| 149 | 29 | 0.6% |
| 2 | 28 | 0.6% |
| 95 | 26 | 0.5% |
| 506 | 24 | 0.5% |
| 50 | 21 | 0.4% |
| 4 | 21 | 0.4% |
| 71 | 20 | 0.4% |
| 5 | 19 | 0.4% |
| Other values (1649) | 4582 |
| Value | Count | Frequency (%) |
| 0 | 130 | |
| 1 | 19 | 0.4% |
| 2 | 28 | 0.6% |
| 3 | 19 | 0.4% |
| 4 | 21 | 0.4% |
| 5 | 19 | 0.4% |
| 6 | 15 | 0.3% |
| 7 | 13 | 0.3% |
| 8 | 12 | 0.2% |
| 9 | 13 | 0.3% |
| Value | Count | Frequency (%) |
| 1658816 | 1 | < 0.1% |
| 1163262 | 4 | |
| 861745 | 1 | < 0.1% |
| 797780 | 6 | |
| 795025 | 3 | |
| 735579 | 4 | |
| 699283 | 7 | |
| 557625 | 2 | < 0.1% |
| 377143 | 1 | < 0.1% |
| 284444 | 1 | < 0.1% |
| Distinct | 2262 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109699.399 |
| Minimum | 0 |
|---|---|
| Maximum | 13974880 |
| Zeros | 132 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 123.5 |
| median | 616 |
| Q3 | 4881 |
| 95-th percentile | 207664 |
| Maximum | 13974880 |
| Range | 13974880 |
| Interquartile range (IQR) | 4757.5 |
Descriptive statistics
| Standard deviation | 836945.1911 |
|---|---|
| Coefficient of variation (CV) | 7.6294419 |
| Kurtosis | 171.4437323 |
| Mean | 109699.399 |
| Median Absolute Deviation (MAD) | 594 |
| Skewness | 12.3426694 |
| Sum | 547948498 |
| Variance | 7.00477253 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 132 | 2.6% |
| 1 | 29 | 0.6% |
| 640701 | 26 | 0.5% |
| 360 | 25 | 0.5% |
| 46 | 25 | 0.5% |
| 10 | 24 | 0.5% |
| 4 | 23 | 0.5% |
| 2 | 23 | 0.5% |
| 19 | 20 | 0.4% |
| 6 | 20 | 0.4% |
| Other values (2252) | 4648 |
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 29 | 0.6% |
| 2 | 23 | 0.5% |
| 3 | 20 | 0.4% |
| 4 | 23 | 0.5% |
| 5 | 18 | 0.4% |
| 6 | 20 | 0.4% |
| 7 | 19 | 0.4% |
| 8 | 12 | 0.2% |
| 9 | 16 | 0.3% |
| Value | Count | Frequency (%) |
| 13974880 | 9 | |
| 8631496 | 14 | |
| 7131754 | 1 | < 0.1% |
| 6731254 | 1 | < 0.1% |
| 6703994 | 6 | |
| 6508178 | 1 | < 0.1% |
| 4930135 | 3 | 0.1% |
| 4895207 | 1 | < 0.1% |
| 3636998 | 2 | < 0.1% |
| 3319323 | 1 | < 0.1% |
| Distinct | 3004 |
|---|---|
| Distinct (%) | 60.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10745.4382 |
| Minimum | 0 |
|---|---|
| Maximum | 986428 |
| Zeros | 4 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 394 |
| median | 1978 |
| Q3 | 7052 |
| 95-th percentile | 43969.6 |
| Maximum | 986428 |
| Range | 986428 |
| Interquartile range (IQR) | 6658 |
Descriptive statistics
| Standard deviation | 38135.94718 |
|---|---|
| Coefficient of variation (CV) | 3.549036016 |
| Kurtosis | 213.8727319 |
| Mean | 10745.4382 |
| Median Absolute Deviation (MAD) | 1851 |
| Skewness | 11.75945751 |
| Sum | 53727191 |
| Variance | 1454350468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 42 | 0.8% |
| 1916 | 26 | 0.5% |
| 2 | 23 | 0.5% |
| 2341 | 22 | 0.4% |
| 13 | 18 | 0.4% |
| 3 | 18 | 0.4% |
| 17 | 17 | 0.3% |
| 14 | 16 | 0.3% |
| 1359 | 15 | 0.3% |
| 6 | 14 | 0.3% |
| Other values (2994) | 4789 |
| Value | Count | Frequency (%) |
| 0 | 4 | 0.1% |
| 1 | 42 | |
| 2 | 23 | |
| 3 | 18 | |
| 4 | 14 | 0.3% |
| 5 | 10 | 0.2% |
| 6 | 14 | 0.3% |
| 7 | 9 | 0.2% |
| 8 | 12 | 0.2% |
| 9 | 13 | 0.3% |
| Value | Count | Frequency (%) |
| 986428 | 2 | < 0.1% |
| 526044 | 1 | < 0.1% |
| 477992 | 1 | < 0.1% |
| 453677 | 1 | < 0.1% |
| 422586 | 1 | < 0.1% |
| 410667 | 2 | < 0.1% |
| 339867 | 8 | |
| 332369 | 1 | < 0.1% |
| 330094 | 1 | < 0.1% |
| 325380 | 1 | < 0.1% |
| Distinct | 426 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 364 |
| Missing (%) | 7.3% |
| Memory size | 39.2 KiB |
| türkiye | |
|---|---|
| ankara | |
| istanbul | |
| bursa | 226 |
| antalya | 201 |
| Other values (421) |
Length
| Max length | 31 |
|---|---|
| Median length | 7 |
| Mean length | 7.114969802 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 271 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | na |
|---|---|
| 2nd row | na |
| 3rd row | turkey |
| 4th row | istanbul |
| 5th row | turkey |
Common Values
| Value | Count | Frequency (%) |
| türkiye | 798 | |
| ankara | 613 | 12.3% |
| istanbul | 374 | 7.5% |
| bursa | 226 | 4.5% |
| antalya | 201 | 4.0% |
| adana | 161 | 3.2% |
| turkey | 141 | 2.8% |
| mersin | 103 | 2.1% |
| eskiÅŸehir | 102 | 2.0% |
| samsun | 82 | 1.6% |
| Other values (416) | 1835 | |
| (Missing) | 364 | 7.3% |
Length
| Value | Count | Frequency (%) |
| tã¼rkiye | 798 | |
| ankara | 613 | 13.2% |
| istanbul | 374 | 8.1% |
| bursa | 226 | 4.9% |
| antalya | 201 | 4.3% |
| adana | 161 | 3.5% |
| turkey | 141 | 3.0% |
| mersin | 103 | 2.2% |
| eskiåÿehir | 102 | 2.2% |
| samsun | 82 | 1.8% |
| Other values (416) | 1835 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 6 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3964757709 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 4286 |
| Zeros (%) | 85.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.683259428 |
|---|---|
| Coefficient of variation (CV) | 4.245554335 |
| Kurtosis | 288.7410113 |
| Mean | 0.3964757709 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.73367316 |
| Sum | 1980 |
| Variance | 2.833362302 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4286 | |
| 1 | 315 | 6.3% |
| 2 | 145 | 2.9% |
| 3 | 100 | 2.0% |
| 4 | 61 | 1.2% |
| 6 | 23 | 0.5% |
| 5 | 15 | 0.3% |
| 9 | 9 | 0.2% |
| 12 | 8 | 0.2% |
| 8 | 7 | 0.1% |
| Other values (11) | 25 | 0.5% |
| (Missing) | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4286 | |
| 1 | 315 | 6.3% |
| 2 | 145 | 2.9% |
| 3 | 100 | 2.0% |
| 4 | 61 | 1.2% |
| 5 | 15 | 0.3% |
| 6 | 23 | 0.5% |
| 7 | 5 | 0.1% |
| 8 | 7 | 0.1% |
| 9 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 29 | 2 | < 0.1% |
| 28 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 4 | |
| 12 | 8 | |
| 11 | 3 | 0.1% |
| Distinct | 37 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5278 |
| Minimum | 0 |
|---|---|
| Maximum | 84 |
| Zeros | 1954 |
| Zeros (%) | 39.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 9 |
| Maximum | 84 |
| Range | 84 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 4.033122906 |
|---|---|
| Coefficient of variation (CV) | 1.595507123 |
| Kurtosis | 83.91174302 |
| Mean | 2.5278 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.076941003 |
| Sum | 12639 |
| Variance | 16.26608038 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1954 | |
| 1 | 765 | 15.3% |
| 2 | 546 | 10.9% |
| 3 | 428 | 8.6% |
| 4 | 334 | 6.7% |
| 5 | 231 | 4.6% |
| 6 | 184 | 3.7% |
| 7 | 149 | 3.0% |
| 8 | 108 | 2.2% |
| 9 | 77 | 1.5% |
| Other values (27) | 224 | 4.5% |
| Value | Count | Frequency (%) |
| 0 | 1954 | |
| 1 | 765 | 15.3% |
| 2 | 546 | 10.9% |
| 3 | 428 | 8.6% |
| 4 | 334 | 6.7% |
| 5 | 231 | 4.6% |
| 6 | 184 | 3.7% |
| 7 | 149 | 3.0% |
| 8 | 108 | 2.2% |
| 9 | 77 | 1.5% |
| Value | Count | Frequency (%) |
| 84 | 2 | |
| 53 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 3 | |
| 34 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
UpperCaseLetter#
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 80 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.274 |
| Minimum | 0 |
|---|---|
| Maximum | 239 |
| Zeros | 668 |
| Zeros (%) | 13.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 16 |
| Maximum | 239 |
| Range | 239 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 10.68225379 |
|---|---|
| Coefficient of variation (CV) | 2.499357462 |
| Kurtosis | 171.57481 |
| Mean | 4.274 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 10.45974138 |
| Sum | 21370 |
| Variance | 114.1105461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2062 | |
| 0 | 668 | 13.4% |
| 2 | 587 | 11.7% |
| 3 | 305 | 6.1% |
| 4 | 269 | 5.4% |
| 5 | 225 | 4.5% |
| 6 | 135 | 2.7% |
| 7 | 93 | 1.9% |
| 8 | 82 | 1.6% |
| 9 | 75 | 1.5% |
| Other values (70) | 499 | 10.0% |
| Value | Count | Frequency (%) |
| 0 | 668 | 13.4% |
| 1 | 2062 | |
| 2 | 587 | 11.7% |
| 3 | 305 | 6.1% |
| 4 | 269 | 5.4% |
| 5 | 225 | 4.5% |
| 6 | 135 | 2.7% |
| 7 | 93 | 1.9% |
| 8 | 82 | 1.6% |
| 9 | 75 | 1.5% |
| Value | Count | Frequency (%) |
| 239 | 1 | |
| 235 | 1 | |
| 225 | 1 | |
| 195 | 1 | |
| 136 | 1 | |
| 129 | 1 | |
| 128 | 1 | |
| 120 | 1 | |
| 100 | 1 | |
| 99 | 1 |
| Distinct | 244 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.1662 |
| Minimum | 4 |
|---|---|
| Maximum | 249 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 35 |
| median | 63 |
| Q3 | 116 |
| 95-th percentile | 220 |
| Maximum | 249 |
| Range | 245 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 62.52691341 |
|---|---|
| Coefficient of variation (CV) | 0.7518308328 |
| Kurtosis | -0.08190286811 |
| Mean | 83.1662 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 0.9921842616 |
| Sum | 415831 |
| Variance | 3909.614901 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 69 | 1.4% |
| 43 | 63 | 1.3% |
| 25 | 59 | 1.2% |
| 27 | 58 | 1.2% |
| 34 | 56 | 1.1% |
| 20 | 56 | 1.1% |
| 39 | 54 | 1.1% |
| 37 | 53 | 1.1% |
| 30 | 53 | 1.1% |
| 56 | 53 | 1.1% |
| Other values (234) | 4426 |
| Value | Count | Frequency (%) |
| 4 | 4 | 0.1% |
| 5 | 13 | 0.3% |
| 6 | 7 | 0.1% |
| 7 | 11 | 0.2% |
| 8 | 25 | |
| 9 | 21 | |
| 10 | 30 | |
| 11 | 29 | |
| 12 | 33 | |
| 13 | 26 |
| Value | Count | Frequency (%) |
| 249 | 1 | < 0.1% |
| 247 | 1 | < 0.1% |
| 246 | 1 | < 0.1% |
| 245 | 1 | < 0.1% |
| 243 | 3 | 0.1% |
| 242 | 1 | < 0.1% |
| 241 | 6 | |
| 240 | 7 | |
| 239 | 6 | |
| 238 | 9 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0826 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 4771 |
| Zeros (%) | 95.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4747866094 |
|---|---|
| Coefficient of variation (CV) | 5.748021906 |
| Kurtosis | 94.59551993 |
| Mean | 0.0826 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.580401114 |
| Sum | 413 |
| Variance | 0.2254223245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4771 | |
| 1 | 136 | 2.7% |
| 2 | 55 | 1.1% |
| 4 | 18 | 0.4% |
| 3 | 9 | 0.2% |
| 5 | 4 | 0.1% |
| 6 | 4 | 0.1% |
| 8 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4771 | |
| 1 | 136 | 2.7% |
| 2 | 55 | 1.1% |
| 3 | 9 | 0.2% |
| 4 | 18 | 0.4% |
| 5 | 4 | 0.1% |
| 6 | 4 | 0.1% |
| 8 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 3 | 0.1% |
| 6 | 4 | 0.1% |
| 5 | 4 | 0.1% |
| 4 | 18 | 0.4% |
| 3 | 9 | 0.2% |
| 2 | 55 | 1.1% |
| 1 | 136 | 2.7% |
| 0 | 4771 |
| Distinct | 48 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.071 |
| Minimum | 1 |
|---|---|
| Maximum | 49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 11 |
| Q3 | 20 |
| 95-th percentile | 36 |
| Maximum | 49 |
| Range | 48 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.47002786 |
|---|---|
| Coefficient of variation (CV) | 0.7440855558 |
| Kurtosis | 0.1568164349 |
| Mean | 14.071 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.048269592 |
| Sum | 70355 |
| Variance | 109.6214833 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 354 | 7.1% |
| 5 | 334 | 6.7% |
| 7 | 292 | 5.8% |
| 4 | 291 | 5.8% |
| 8 | 274 | 5.5% |
| 9 | 271 | 5.4% |
| 10 | 240 | 4.8% |
| 3 | 238 | 4.8% |
| 11 | 215 | 4.3% |
| 12 | 191 | 3.8% |
| Other values (38) | 2300 |
| Value | Count | Frequency (%) |
| 1 | 54 | 1.1% |
| 2 | 140 | 2.8% |
| 3 | 238 | |
| 4 | 291 | |
| 5 | 334 | |
| 6 | 354 | |
| 7 | 292 | |
| 8 | 274 | |
| 9 | 271 | |
| 10 | 240 |
| Value | Count | Frequency (%) |
| 49 | 2 | < 0.1% |
| 47 | 2 | < 0.1% |
| 46 | 5 | 0.1% |
| 45 | 5 | 0.1% |
| 44 | 7 | 0.1% |
| 43 | 12 | 0.2% |
| 42 | 26 | |
| 41 | 29 | |
| 40 | 31 | |
| 39 | 36 |
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6872 |
| Minimum | 0 |
|---|---|
| Maximum | 40 |
| Zeros | 675 |
| Zeros (%) | 13.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 9 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.638406761 |
|---|---|
| Coefficient of variation (CV) | 1.353976913 |
| Kurtosis | 21.12780574 |
| Mean | 2.6872 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.679395601 |
| Sum | 13436 |
| Variance | 13.23800376 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2103 | |
| 0 | 675 | 13.5% |
| 2 | 672 | 13.4% |
| 3 | 388 | 7.8% |
| 4 | 291 | 5.8% |
| 5 | 207 | 4.1% |
| 6 | 145 | 2.9% |
| 7 | 108 | 2.2% |
| 8 | 86 | 1.7% |
| 9 | 84 | 1.7% |
| Other values (25) | 241 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 675 | 13.5% |
| 1 | 2103 | |
| 2 | 672 | 13.4% |
| 3 | 388 | 7.8% |
| 4 | 291 | 5.8% |
| 5 | 207 | 4.1% |
| 6 | 145 | 2.9% |
| 7 | 108 | 2.2% |
| 8 | 86 | 1.7% |
| 9 | 84 | 1.7% |
| Value | Count | Frequency (%) |
| 40 | 1 | < 0.1% |
| 39 | 2 | < 0.1% |
| 38 | 2 | < 0.1% |
| 36 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 3 | |
| 28 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 6 |
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3518 |
| Minimum | 0 |
|---|---|
| Maximum | 40 |
| Zeros | 4394 |
| Zeros (%) | 87.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.74924233 |
|---|---|
| Coefficient of variation (CV) | 4.972263588 |
| Kurtosis | 204.0404334 |
| Mean | 0.3518 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.93275092 |
| Sum | 1759 |
| Variance | 3.05984873 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4394 | |
| 1 | 325 | 6.5% |
| 2 | 100 | 2.0% |
| 3 | 49 | 1.0% |
| 4 | 36 | 0.7% |
| 6 | 20 | 0.4% |
| 5 | 16 | 0.3% |
| 8 | 16 | 0.3% |
| 7 | 14 | 0.3% |
| 10 | 4 | 0.1% |
| Other values (16) | 26 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 4394 | |
| 1 | 325 | 6.5% |
| 2 | 100 | 2.0% |
| 3 | 49 | 1.0% |
| 4 | 36 | 0.7% |
| 5 | 16 | 0.3% |
| 6 | 20 | 0.4% |
| 7 | 14 | 0.3% |
| 8 | 16 | 0.3% |
| 9 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 40 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 3 | |
| 17 | 1 | < 0.1% |
| 16 | 2 |
SlangWords#
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7808 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 2206 |
| Zeros (%) | 44.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9141764073 |
|---|---|
| Coefficient of variation (CV) | 1.170820194 |
| Kurtosis | 4.82699131 |
| Mean | 0.7808 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.728787084 |
| Sum | 3904 |
| Variance | 0.8357185037 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2206 | |
| 1 | 2042 | |
| 2 | 531 | 10.6% |
| 3 | 126 | 2.5% |
| 4 | 65 | 1.3% |
| 5 | 20 | 0.4% |
| 6 | 8 | 0.2% |
| 7 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2206 | |
| 1 | 2042 | |
| 2 | 531 | 10.6% |
| 3 | 126 | 2.5% |
| 4 | 65 | 1.3% |
| 5 | 20 | 0.4% |
| 6 | 8 | 0.2% |
| 7 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 2 | < 0.1% |
| 6 | 8 | 0.2% |
| 5 | 20 | 0.4% |
| 4 | 65 | 1.3% |
| 3 | 126 | 2.5% |
| 2 | 531 | 10.6% |
| 1 | 2042 | |
| 0 | 2206 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 170 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.53747412 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.460511998 |
|---|---|
| Coefficient of variation (CV) | 0.2637505776 |
| Kurtosis | 233.5974868 |
| Mean | 5.53747412 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 8.007463239 |
| Sum | 26746 |
| Variance | 2.133095296 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1817 | |
| 6 | 1403 | |
| 4 | 683 | 13.7% |
| 7 | 510 | 10.2% |
| 8 | 176 | 3.5% |
| 3 | 117 | 2.3% |
| 9 | 58 | 1.2% |
| 10 | 22 | 0.4% |
| 11 | 15 | 0.3% |
| 2 | 10 | 0.2% |
| Other values (7) | 19 | 0.4% |
| (Missing) | 170 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 3 | 0.1% |
| 2 | 10 | 0.2% |
| 3 | 117 | 2.3% |
| 4 | 683 | 13.7% |
| 5 | 1817 | |
| 6 | 1403 | |
| 7 | 510 | 10.2% |
| 8 | 176 | 3.5% |
| 9 | 58 | 1.2% |
| 10 | 22 | 0.4% |
| Value | Count | Frequency (%) |
| 53 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 3 | 0.1% |
| 12 | 7 | 0.1% |
| 11 | 15 | 0.3% |
| 10 | 22 | 0.4% |
| 9 | 58 | 1.2% |
| 8 | 176 |
IsCyberbullying
Boolean
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 2500 | |
| True | 2500 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Id | Text | IsRetweet | IsSelfMentioned | Retweets# | Favorites# | Hashtags# | Medias# | Mentions# | SenderId | SenderAccountYears | SenderFavorites# | SenderFollowings# | SenderFollowers# | SenderStatues# | SenderLocation | Emojis# | Punctuations# | UpperCaseLetter# | Letter# | Symbols# | Words# | TWords# | UWords# | SlangWords# | AvgWordLength | IsCyberbullying | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.123850e+18 | bir adam yanında çocuklaşan kadını fazladan sevmeli çünkü bu yalnızken hep güçlü göründüm izninle huzur bulduğum yerde biraz şımarmak istiyorum deme şeklidir perşembe | False | False | 59.0 | 1045 | 1 | 0 | 0 | 1.935601e+09 | 2020 | 0 | 0 | 0.0 | 0 | NaN | 0.0 | 9 | 5 | 144 | 0 | 23 | 5 | 0 | 0 | 6.0 | False |
| 1 | 1.161960e+18 | mağlup mu desem mahcup mu ama ikisi de değil ben garip sen güzel dünya umutlu öyle bir tuhafım bu akşamüstü sevgilim canavar götünü gibi iki yanım iki süngü ahmed arif perşembe | False | False | 3.0 | 157 | 1 | 0 | 0 | 1.935601e+09 | 2020 | 0 | 0 | 0.0 | 0 | na | 0.0 | 8 | 8 | 147 | 0 | 31 | 8 | 0 | 1 | 4.0 | False |
| 2 | 1.162600e+18 | günaydın iyi pazarlar allah acil şifalar versin inşallah daha da iyi olacak | False | False | 1.0 | 3 | 0 | 0 | 11 | 9.276140e+17 | 2020 | 0 | 0 | 0.0 | 0 | NaN | 3.0 | 0 | 2 | 64 | 2 | 12 | 2 | 0 | 0 | 5.0 | False |
| 3 | 1.163020e+18 | ve ahmet arif leyla sına seslenir sevdiğim çaresizliğimden gayrı hiçbir kabahatim yok benim aşına ekmeğine kahrına karanlığına özlemine umuduna kat beni pazar hayalettimde | False | False | 13.0 | 220 | 2 | 0 | 0 | 1.935601e+09 | 2020 | 0 | 0 | 0.0 | 0 | na | 0.0 | 19 | 16 | 150 | 0 | 23 | 14 | 0 | 0 | 6.0 | False |
| 4 | 1.157730e+18 | arkadaki sanal gerzek oyunun oynuyor | False | False | 950.0 | 12104 | 0 | 1 | 0 | 4.495931e+06 | 12 | 22554 | 32766 | 60281.0 | 25482 | turkey | 0.0 | 0 | 1 | 35 | 0 | 5 | 1 | 0 | 1 | 7.0 | False |
| 5 | 1.158390e+18 | ikea dolap montajında zorlanacağına eminim | False | False | NaN | 0 | 0 | 0 | 1 | 3.696241e+06 | 12 | 1047 | 457 | 334.0 | 1853 | istanbul | 0.0 | 1 | 1 | 38 | 0 | 5 | 1 | 0 | 0 | 7.0 | False |
| 6 | 1.158380e+18 | marmaris te yüksek ses ve gürültüye 637 bin lira ceza | False | False | 3.0 | 31 | 0 | 1 | 0 | 1.501621e+07 | 11 | 0 | 39 | 7131754.0 | 221392 | turkey | 0.0 | 1 | 1 | 44 | 0 | 10 | 1 | 0 | 0 | 4.0 | False |
| 7 | 1.158440e+18 | mercedes in takım patronu olsaydınız lewis hamilton ın takım arkadaşı olarak kim tercih ederdiniz | False | False | 1.0 | 13 | 0 | 0 | 0 | 1.631032e+07 | 11 | 3651 | 148 | 18548.0 | 68489 | turkey | 2.0 | 4 | 3 | 84 | 0 | 14 | 3 | 0 | 0 | 6.0 | False |
| 8 | 1.158680e+18 | bu çocuğu hala nasıl takip edebiliyorsunuz ya gösteriş budala resmenn | False | False | NaN | 0 | 0 | 0 | 1 | 1.588324e+07 | 11 | 14343 | 898 | 1985.0 | 2389 | ankara | 0.0 | 0 | 0 | 67 | 0 | 10 | 0 | 0 | 1 | 6.0 | True |
| 9 | 1.162630e+18 | pudingin soğumuş pürüzsüz yüzeyini öpmek zevk alınan ufak sapık | False | False | 3.0 | 39 | 0 | 0 | 0 | 1.662696e+07 | 11 | 10 | 5 | 2653596.0 | 16833 | istanbul | 0.0 | 3 | 0 | 61 | 1 | 9 | 0 | 0 | 1 | 6.0 | False |
Last rows
| Id | Text | IsRetweet | IsSelfMentioned | Retweets# | Favorites# | Hashtags# | Medias# | Mentions# | SenderId | SenderAccountYears | SenderFavorites# | SenderFollowings# | SenderFollowers# | SenderStatues# | SenderLocation | Emojis# | Punctuations# | UpperCaseLetter# | Letter# | Symbols# | Words# | TWords# | UWords# | SlangWords# | AvgWordLength | IsCyberbullying | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | 1.207290e+18 | tipitip dallama iÅŸte | False | False | 0.0 | 0 | 0 | 0 | 2 | 1.183810e+18 | 0 | 4749 | 217 | 62.0 | 659 | NaN | 0.0 | 0 | 1 | 20 | 0 | 3 | 1 | 0 | 1 | 6.0 | True |
| 4991 | 1.207300e+18 | boş bulunma ortamında barındırma puşt | False | False | 0.0 | 0 | 0 | 0 | 0 | 1.161600e+18 | 0 | 547 | 82 | 87.0 | 33 | mersin | 0.0 | 0 | 0 | 34 | 0 | 5 | 0 | 0 | 1 | 6.0 | True |
| 4992 | 1.207300e+18 | sen verme ak akÅŸam akÅŸam iti kopuÄŸu puÅŸt kuÅŸtu uÄŸraÅŸmayak amk kfkeke | False | False | 0.0 | 1 | 0 | 0 | 1 | 1.207000e+18 | 0 | 30 | 24 | 14.0 | 35 | NaN | 0.0 | 1 | 2 | 58 | 0 | 12 | 2 | 0 | 2 | 4.0 | True |
| 4993 | 1.207300e+18 | sizinki iman falan da degil basbayağı orospu tamam mi vicdanına soktugumun pustu | False | False | 0.0 | 0 | 0 | 0 | 2 | 1.119680e+18 | 0 | 83 | 63 | 2.0 | 579 | NaN | 0.0 | 1 | 2 | 72 | 0 | 12 | 2 | 0 | 1 | 6.0 | True |
| 4994 | 1.207300e+18 | fahişe olmuş ruhların vesikayla ispatı yoktur vesikası olan kendini namuslu sanırmış çomar | False | False | 0.0 | 1 | 0 | 0 | 2 | 1.121490e+18 | 0 | 17176 | 430 | 251.0 | 5907 | NaN | 0.0 | 1 | 2 | 79 | 0 | 12 | 2 | 0 | 1 | 6.0 | True |
| 4995 | 1.207300e+18 | sakalını sktiğimin moron milleti korkutmak için emniyeti mentliyen bebe siz avrupadan gelen fonları cebe atacaksınız diye bu ülke vatandaşları her gün intihar ediyor sen adres ver ben senin ziyaretine gelirim | False | False | 0.0 | 0 | 0 | 0 | 1 | 1.163550e+18 | 0 | 22452 | 836 | 804.0 | 2460 | intikam | 0.0 | 4 | 2 | 180 | 0 | 30 | 2 | 0 | 1 | 6.0 | True |
| 4996 | 1.207300e+18 | yavÅŸak salmadi hala serefsiz | False | False | 0.0 | 0 | 0 | 0 | 0 | 1.183450e+18 | 0 | 9053 | 208 | 212.0 | 4174 | day6 | 0.0 | 0 | 0 | 25 | 0 | 4 | 0 | 0 | 1 | 6.0 | True |
| 4997 | 1.207300e+18 | bu şerefsiz i biraz açmak için seçildiğinde yemin etti şerefinin ve namusunun üzerine herkese cumhurbaşkanı olmak için yaptımı hayır onun için şerefsiz namussuz diplomasız işgal ettiği yer için yalancı sahtekâr dolandırıcı konuşmalarında yaptıklarında vatan haini rte | False | False | 0.0 | 0 | 0 | 0 | 2 | 1.201560e+18 | 0 | 11 | 5 | 0.0 | 104 | NaN | 0.0 | 7 | 5 | 232 | 0 | 36 | 5 | 0 | 0 | 6.0 | True |
| 4998 | 1.207300e+18 | battal etmek fahişe gemleme bulaşıkhane tecavüb | False | False | 0.0 | 0 | 0 | 0 | 1 | 1.207070e+18 | 0 | 0 | 0 | 0.0 | 303 | NaN | 0.0 | 0 | 0 | 42 | 0 | 6 | 0 | 0 | 1 | 7.0 | True |
| 4999 | 1.207300e+18 | yine ortalık rahibe götünü fahişe kaynıyor bir yavaş öslxşeöod | False | False | 0.0 | 7 | 0 | 0 | 0 | 1.184090e+18 | 0 | 286 | 651 | 676.0 | 60 | adana | 0.0 | 0 | 1 | 57 | 0 | 9 | 1 | 0 | 2 | 6.0 | True |
Most frequently occurring
| Id | Text | IsRetweet | IsSelfMentioned | Retweets# | Favorites# | Hashtags# | Medias# | Mentions# | SenderId | SenderAccountYears | SenderFavorites# | SenderFollowings# | SenderFollowers# | SenderStatues# | SenderLocation | Emojis# | Punctuations# | UpperCaseLetter# | Letter# | Symbols# | Words# | TWords# | UWords# | SlangWords# | AvgWordLength | IsCyberbullying | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.160730e+18 | merhaba ben kayseri den travesti hasret sevda görüşmeleri mi kendime ait evimde yapıyorum ne aradıgını biliyorsan ve ciddiysen eğer görüşelim müsaitim şuan 0541 691 29 19 bayan degilim 0541 691 29 19 | False | False | 0.0 | 0 | 0 | 1 | 0 | 7.315140e+17 | 3 | 91 | 1815 | 2533.0 | 6545 | kayseri | 0.0 | 9 | 18 | 168 | 0 | 32 | 8 | 2 | 1 | 5.0 | True | 2 |
| 1 | 1.207280e+18 | seni santim santim öperek sokarım sonra her milimini yalar amına dil atar zevke getiririm altıma alıp bacaklarını omuzuma dayarım yarağımı amına sokup dibine kadar köklerim altımda tamamen kavrayıp sert sert hızlı hızlı haşin haşin bağırta bağırta inlete inlete sikerim dm ye | False | False | 0.0 | 0 | 0 | 0 | 1 | 1.043120e+18 | 1 | 4808 | 966 | 90.0 | 1195 | istanbul | 0.0 | 6 | 7 | 235 | 0 | 41 | 6 | 1 | 4 | 5.0 | True | 2 |